Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
نویسندگان
چکیده
Video salient object detection (VSOD), as a fundamental computer vision problem, has been extensively discussed in the last decade. However, all existing works focus on addressing VSOD problem 2D scenarios. With rapid development of VR devices, panoramic videos have promising alternative to provide immersive feelings real world. In this paper, we aim tackle video for videos, with their corresponding ambisonic audios. A multimodal fusion module equipped two pseudo-siamese audio-visual context (ACF) blocks is proposed effectively conduct interaction. The ACF block spherical positional encoding enables 3D capture spatial correspondence between pixels and sound sources from equirectangular frames Experimental results verify effectiveness our components demonstrate that method achieves state-of-the-art performance ASOD60K dataset.
منابع مشابه
Efficient Co-Salient Video Object Detection Based on Preattentive Processing
Automatic video annotation is a critical step for contentbased video retrieval and browsing. Detecting the focus of interest such as co-occurring objects in video frames automatically can benefit the tedious manual labeling process. However, detecting the co-occurring objects that is visually salient in video sequences is a challenging task. In this paper, in order to detect co-salient video ob...
متن کاملVideo Salient Object Detection Using Spatiotemporal Deep Features
This paper presents a method for detecting salient objects in videos where temporal information in addition to spatial information is fully taken into account. Following recent reports on the advantage of deep features over conventional handcrafted features, we propose the SpatioTemporal Deep (STD) feature that utilizes local and global contexts over frames. We also propose the SpatioTemporal C...
متن کاملSalient Object Detection with Semantic Priors
Salient object detection has increasingly become a popular topic in cognitive and computational sciences, including computer vision and artificial intelligence research. In this paper, we propose integrating semantic priors into the salient object detection process. Our algorithm consists of three basic steps. Firstly, the explicit saliency map is obtained based on the semantic segmentation ref...
متن کاملSalient Object Detection: A Survey
Detecting and segmenting salient objects in natural scenes, also known as salient object detection, has attracted a lot of focused research in computer vision and has resulted in many applications. However, while many such models exist, a deep understanding of achievements and issues is lacking. We aim to provide a comprehensive review of the recent progress in this field. We situate salient ob...
متن کاملAutomatic Salient Object Detection for Panoramic Images Using Region Growing and Fixation Prediction Model
Saliency detection aims to detect the most attractive objects in images, which has been widely used as a foundation for various multimedia applications. In this paper, we propose a novel salient object detection algorithm for RGB-D images using center-dark channel prior. First, we generate an initial saliency map based on a color saliency map and a depth saliency map of a given RGB-D image. The...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i2.25227